Skip to content

fix transformers 4.52 device_map ddp #4424

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

Jintao-Huang
Copy link
Collaborator

No description provided.

@Jintao-Huang Jintao-Huang merged commit ab41c74 into modelscope:main May 30, 2025
2 checks passed
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request Jun 4, 2025
…n3-emb

* commit '9dfa63a060ba6de6f53a0a00cf99c3025ea3fe18': (35 commits)
  Fix create checkpoint symlink & grpo omni (modelscope#4468)
  [grpo] fix base url (modelscope#4463)
  [train] Fix qwen2.5-vl use_cache (modelscope#4458)
  [seq_parallel] fix sp compute_acc (modelscope#4456)
  [dpo] support dpo padding_free/logits_to_keep & dpo compat trl==0.18 (modelscope#4394)
  [grpo] fix hang in colocate lora settings (modelscope#4451)
  [grpo] Two-Sided Clipping for GRPO Trainer (modelscope#4450)
  [grpo] support vllm_server_base_url for vLLMClient (modelscope#4449)
  [template] fix vlm padding_free/logits_to_keep (modelscope#4444)
  fix qwen2_5_vl awq (modelscope#4436)
  [megatron] support megatron num_train_epochs (modelscope#4432)
  fix emb docs (modelscope#4434)
  fix model_meta (modelscope#4431)
  [model] Support MiMo-VL (modelscope#4429)
  [dataset] add ms_logger_context (modelscope#4428)
  [dataset] fix self-cognition & load_from_cache_file (modelscope#4426)
  fix transformers 4.52 device_map ddp (modelscope#4424)
  Fix cmdline parsing error on Windows system (modelscope#4422)
  support DeepSeek-R1-0528-Qwen3-8B (modelscope#4417)
  [pt/sft] support use_logits_to_keep & support DeepSeek-R1-0528 (modelscope#4409)
  ...

# Conflicts:
#	README.md
#	README_CN.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants